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DETECTION  AND  RECOGNITION 


MODELS  OF  DOLPHIN  SONAR  SYSTEMS 


Whitlow  W.L.  Au 


Naval  Ocean  Systems  Center 
Kailua,  Hawaii  96734 


INTRODUCTION 

Research  has  shown  that  dolphins  possess  highly  sophisticated  sonar 
capabilities.  Several  res^iews  and  papers  presented  at  the  second  inter¬ 
national  meeting  on  Animal  Sonar  Systems  (Busnel  and  Fish,  1980}  elucidated 
the  sonar  capabilities  of  dolphins.  Nactigall  (1980)  discussed  object  size, 
shape  and  material  composition  discrimination  by  echolocating  odontocetes. 
Murchison  (1980)  presented  data  on  target  detection  and  range  resolution 
capabilities  of  Tursiops  tiuncatus.  Au  (1980)  discussed  signals  used  by 
Tursiops  in  open  waters ,  Data  on  target  recognition  of  cylinders  with  varying 
wall  thicknesses  and  material  composition  (Au  and  Hammer,  1980)  and  on 
sphere -cylinder  discrimination  (Au  et  al. ,  1980)  were  also  presented.  These 
and  other  studies  indicate  that  dolphin  sonars  are  superior  to  any  man-made 
sonar  systems  for  short  ranges  (two  to  three  hundred  meters) ,  shallow  water 
(typical  of  bays,  inlets,  and  coastal  waters) .  The  dolphin  sonar  may  be 
considered  the  premier  sonar  for  the  detection  and  recognition  of  slow  moving 
or  stationary  targets  in  shallow  waters  where  the  reverberation  and  noise' 
background  levels  are  high. 

The  objective  of  this  paper  is  to  examine  the  dolphin  sonar  system  from 
theoretical  and  empirical  prospectives.  Appropriate  ideas  and  models  which 
may  improve  our  understanding  of  the  echolocation  process  will  also  be 
considered.  Human  listerning  results  using  simulated  dolphin  echolocation 
signals  to  insonify  targets  will  also  be  used  to  gain  insights  into  available 
target  cues  on  which  target  recognition  and  discrimination  may  be  made  by 
echolocating  dolphins. 


I .  APPLICATION  OF  THE  SONAR  EQUATION 

The  noise- limited  form  of  the  sonar  equation  can  be  used  to  describe  the 
target  detection  performance  of  a  sonar.  It  equates  the  detection  threshold 
(DT)  to  the  echo-to-noise  ratio  when  the  target  is  just  being  detected,  and 
can  be  expressed  in  dBs  as  (Urick,  1983) 

DT  -  SL  -  2TL  +  TS  -  (NL  -  DI) 

Echo  level  Noise  level 


(1) 


where ; 


SL  -  source  level 

XL  -  transmission  loss 

TS  -  target  strength 

NL  -  background  noise  level 

DI  -  receiving  directivity  index 


The  equation  Is  written  in  terms  of  intensity  or  the  average  acoustic 
power  per  unit  area.  However,  when  dealing  with  transient -like  signals 
such  as  dolphin  signals,  echoes  may  be  many  times  longer  than  the  trans¬ 
mitted  signal.  An  example  of  a  simulated  dolphin  echolocation  signal  and 
an  echo  from  a  7.62-cm  diam.  water-filled  sphere  are  shown  in  Fig.  1. 
Therefore,  eq.  1  should  be  transformed  to  a  more  generalized  form  involv¬ 
ing  energy  flux  density  (Urick,  1983) .  This  can  be  achieved  by  using  the 
relation  between  intensity  (X)  and  energy  flux  density  (E)  and  the 
definition  of  TS,  _ 


T  _  1  /■  P^(t)  dt  _  E 

T  J  pc  T 


(2) 


0 


TS  -  10  Log  echo  intensity  1  m  from  target  (3) 

incident  intensity 


Using  eq.  2,  the  source  level  term  of  eq.  1  can  now  be  written  as 


SL  -  10  Log  I  -  SE  -  10  Log  (4) 

where  the  source  energy  flux  density  SE  -  10  Log  E  and  Ti  is  the  duration 
of  the  projected  signal.  Similarly,  TS  of  eq.  4  can  be  rewritten  as 

TS  -  TSg  -  10  Log  (  Tg/  Tj^)  (5) 

where  TSg  is  the  target  strength  based  on  the  ratio  of  the  energy  in  the 
echo  over  the  incident  energy,  and  ^  is  the  duration  of  the  echo. 

However,  if  we  assume  that  dolphin  detect  signals  in  noise  like  an  energy 
detector  having  a  specific  integration  time  of  then  should  be 

used  in  place  of  in  eq.  5.  Substituting  eqs.  4  and  5  into  eq.  1,  we  get 
the  transient  form  of  the  sonar  equation  applicable  to  a  dolphin 

DTg  -  DT  -  10  Log  Tint  -  SE  -  2TL  +  TS  -  (NL  -  DI)  (6) 

The  detection  threshold,  DTg  corresponds  to  the  energy- to-noise  ratio  used 
in  human  psychophysics  and  is  equal  to  10  Log(Eg/NQ) ,  where  Eg  is  the  echo 
energy  flux  density  and  Ng  is  the  noise  spectral  density  level.  DT  is  the 
signal- to-noise  ratio  used  in  sonar  engineering. 

Peak-to-peak  sound  pressure  level  (SPLpp)  rather  than  energy  flux 
density  is  normally  measured.  A  simple  relationship  between  E  and  SPLpp 

(a)  W 

INCIDENT  SIGNAL  ECHO  FROM  7.62  CM  WATER-FILLED  SPHERE 

TS  (PP)  -  -30.8  dB 


Fig.  1.  Example  of  a  simulated  dolphin  echolocation  signal  and  the  echo 
from  a  7.62-cm  diameter  water-filled  sphere. 


can  be  derived  by  letting  the  acoustic  signal  equal  A*s(t),  where  A  is 
the  peak  amplitude  and  s(t)  is  the  waveform  function  (|s(t)l  <  1).  Then, 

SPLpp(dB)  -  20  Log  (2A)  -  20  Log  A  +  6  (7) 

Epp(dB)  -  SPLpp(dB)  +  10  Log  (/Ts2(t)  dt)  -  6.  (8) 

0 

The  Log  integral  term  does  not  vary  much  for  dolphin  signals  in  Kaneohe 
Bay,  and  is  approximately  -52  ±  1  dB.  Equation  8  can  now  be  expressed  as 

E(dB)  -  SPLpp(dB)  -  58  (9) 

Equation  8  along  with  the  source  level  data  of  Au  (1980)  shown  in  Fig.  2 
can  be  used  to  obtain  estimates  of  SE  in  the  sonar  equation. 

Values  of  the  receiving  directivity  index,  DI,  were  calculated  by  Au 
and  Moore  (1984)  from  measured  receiving  beau  patterns  of  Tursioos.  Their 
results  are  shown  in  Fig.  3  for  frequencies  of  30,  60  and  120  kHz.  The 
linear  curve  fitted  to  the  results  can  be  expressed  as 

DI(dB)  -  16.9  Log  f(kHz)  -  14.5  (10) 

Au  and  Penner  (1981)  determined  the  detection  threshold  for  two 
Tursiops  using  the  maximum  SE  per  trial.  Large  fluctuations  in  SE  occur¬ 
red  in  most  click  trains  and  it  was  not  clear  which  clicks  were  used  by 
the  dolphins  for  target  detection.  Therefore,  conservative  estimates  of 


Fig.  2.  Peak- co-peak  source  level  of  dolphin  echolocaCion  signals 

versus  total  loss  for  five  animals  performing  different  tasks. 


DI'e  representing  the  best  S/N  available  to  the  dolphin  were  nade  using  the 
largest  SE.  The  average  SE  per  trail  was  typically  4  to  5  dB  below  the 
maximum  SE.  Au  (this  volxime)  Included  the  results  of  three  other  experi¬ 
ments  with  detection  thresholds  (Dig)  between  7.2  and  12.7  dB. 

In  comparing  detection  sensitivities  between  humans  and  dolphins  we 
need  to  be  aware  that  detectability  of  a  signal  is  a  function  of  the  sig¬ 
nal  duration,  frequency,  and  bandwidth,  and  the  psychophysical  testing 
procedure.  Two -alternative  forced-choice  (2AFC)  experiments  with  humans 
subjects  indicate  DTg  of  7.9  dB  (Green  et  al.,  1957)  to  5  dB  (Jeffress, 
1968) .  McFadden  (1968)  using  a  Yes/No  paradigm  measured  a  DTg  of  10  dB 
for  a  400  Hz,  125-msec  signal  in  continuous  and  burst  noise.  Martin  and 
Au  (1986),  using  a  Yes/No  paradigm  and  simulated  dolphin  signals  time 
streched  into  the  human  auditory  range  obtained  DTg  of  4.5  dB.  Dolphin 
and  human  signal  detection  thresholds  in  noise  seem  to  be  similar  despite 
differences  in  the  methods  and  signals  used. 


II.  DOLPHIN  SONAR  MODELED  AS  AN  ENERGY  DETECTOR 

The  threshold  versus  signal  duration  and  the  critical  ratio  experi¬ 
ments  of  Johnson  (1968a, b)  with  Tursiops  truncatus  indicate  that  the 
animal's  inner  ear  functions  like  the  human  inner  ear  and  that  the  animal 
integrates  acoustic  energy  in  the  same  way  as  humans.  Green  and  Swets 
(1966)  have  shown  that  an  energy  detector  is  a  good  analogue  of  the  human 
auditory  detection  process.  Therefore,  it  seems  reasonable  to  approach 
the  dolphin  auditory  detection  process  as  an  energy  detector.  Au  and 
Moore  (This  volume)  used  an  electronic  "phantom  target"  to  study  signal 


Fig.  3.  Receiving  directivity  index  for  Tursiops  truncatus  as  a 
function  of  frequency  (From  Au  and  Moore,  1984). 


detection  processes  of  a  dolphin.  One,  two  or  three  replicas  of  each 
click  enltted  were  projected  back  to  the  anlnal.  All  pulses  were  within 
the  integration  time  of  the  dolphin  (Au  and  Moore,  this  volume).  Their 
results  are  shown  in  Fig.  4,  along  with  an  energy  detector  response  curve. 
The  dolphin's  performance  followed  the  response  of  an  energy  detector. 

Urkowltz  (1967)  examined  the  detection  of  a  deterministic  signal  in 
white  Gaussian  noise  using  an  energy  detector  and  derived  expressions  for 
the  correct  detection  and  false  alarm  probabilities.  The  probability  of 
a  false  alarm  for  a  given  threshold  Vj  is  given  by 

P(FA)  -  1  -  Pr(Vx  <  x22tw])  (11) 

where  Pr  is  the  area  under  the  chi-square  distribution  curve  with  2TW 
(time -bandwidth)  degrees  of  freedom.  For  the  same  threshold  level  Vj,  the 
probability  of  a  correct  detection  is  given  by 

P(D)  -  1  -  Pr(Vx/G  <  X^d)  (12) 

where:  D  -  (2TW  +  E/No)V(2TW  +  2E/No)  (13) 

G  -  (2TW  +  2E/No)/(2TW  +  E/Nq) .  (14) 

Pr  is  now  the  area  under  the  noncentral  chi-square  distribution  with  a 
modified  number  of  degrees  of  freedom  D  and  a  threshold  divisor  G. 

These  expressions  derived  by  Urkowitz  (1967)  were  applied  to  dolphin 
detection  in  noise  data  by  assuming  an  unbiased  detector  in  determining 
the  probability  of  a  correct  response  P(C)  from  P(FA)  and  P(D) .  The 
calculation  was  done  by  first  choosing  desired  values  of  P(FA)  and  2TW  and 
then  determining  Vx  by  an  iterative  procedure.  Then  with  the  iterated 
value  of  Vx, 


12  3  4 

NO.  ECHO  PULSE  WITHIN  INTEGRATION  TIME 

Fig. A.  Dolphin  detection  threshold  in  noise  versus  number  of  pulses 
within  an  integration  time  (From  Au  andMoore,  this  volume). 


P(D)  was  calculated  for  different  values  of  E/Nq.  The  procedure  was  con¬ 
tinued  for  different  2TU  degrees  of  freedom,  until  the  values  of  F(C)  were 
obtained  as  a  function  of  E/Nq  which  best  fitted  the  dolphin  data.  The 
performance  data  for  Tursions  detecting  targets  in  masking  noise  in  three 
different  studies  are  shown  in  Fig.  5  along  with  the  results  of  Urkowitz 
energy  detection  model  for  2TW  -  22. 

Urkowitz  energy  detection  model  agrees  well  with  the  dolphins'  per¬ 
formance  results,  further  supporting  the  notion  of  the  dolphin  being  an 
energy  detector.  The  unbiased  detector  assumption  used  to  derive  P(C}  is 
a  good  one  for  signal- to -noise  conditions  at  or  above  the  75%  correct 
threshold.  Tursioos  tends  to  be  tmbias  for  hi^  signal- to-noise  condi¬ 
tions  (Au  and  Synder,  1980;  Au  and  Penner,  1981;  Au  and  Moore,  this 
volume) . 

McGill  (1968)  considered  the  detection  of  a  signal  known  except  for 
phase  using  an  energy  detector  in  a  2AFC  procedure.  Au  and  Penner  (1981) 
applied  McGill's  expression  to  a  Yes/No  paradigm  by  determining  d' (2AFC) 
from  the  tables  of  Elliot  (1964) ,  and  by  using  the  relation  d' (2AFC)  -  2 
d'(Y/N)  (Tanner  and  Sorkin,  1972).  The  values  of  P(C)  was  then  calculated 
as  a  function  of  d' (Y/N)  assuming  an  unbiased  detector.  McGill's  formula 
also  fitted  the  dolphins'  performance. 


Ill  TARGET  RECOGNITION  AND  DISCRIMINATION 


Hammer  and  Au  (1980)  examined  targets  used  in  a  recognition  experiment 
with  simulated  dolphin  signals.  Echoes  were  passed  through  a  matched 
filter  having  the  transmitter  signal  as  the  reference.  The  envelopes  of 
the  matched  filter  provided  information  to  explain  the  dolphin's  perform¬ 
ance.  Times  of  arrival  and  shapes  of  the  matched  filter  envelope  for  the 
different  echo  highlights,  especially  the  second  component,  were  found  to 
be  important.  They  suggested  that  dolphins  may  use  time -separation  pitch 
(TSP)  cues  caused  by  the  interaction  of  highly  correlated  highlights. 


.O 

O' 

UJ 

(/) 

z 

o 

Q. 

(/) 

UJ 

(X 

K 

O 

UJ 

QC 

CC 

O 

O 


-5  0  5  10  15  20 


10  LOG  [Ee/No]MAX  (dB) 


Fig.  5.  Dolphin  performance  results  and  Urkowitz  energy  detector  model. 


Altes  (1980)  proposed  a  viable  dolphin  echolocation  nodel  which  in¬ 
volved  the  time- frequency  energy  density  or  spectrograa  of  echoes.  The 
time -frequency  energy  density  function  contains  information  on  both  high¬ 
light  distribution  and  energy  spectrum.  Target  detection,  recognition, 
and  discrimination  would  be  achieved  by  correlating  the  spectrogram  of 
echoes  with  previously  measured  spectrograms  of  known  targets.  Altes 
spectrogram  model  seems  consistent  with  the  TSP  model  of  Hammer  and  Au 
(1980),  and  with  the  use  of  energy  detection. 

Martin  and  Au  (1982,  1986)  took  a  different  approach  in  studying 
target  discrimination  by  dolphins.  They  performed  experiments  using  the 
excellent  discrimination  and  pattern  recognition  capabilities  of  the  human 
auditory  system  to  analyze  target  echoes  from  the  same  targets  used  in 
dolphin  experiments.  Fish  et  al.  (1976)  previously  demonstrated  that 
instrumented  human  divers,  listening  to  echoes  produced  by  projecting 
dolphin- like  signals,  could  discriminate  the  thickness  and  material  compo¬ 
sition  of  metallic  plates  as  well  as  or  better  than  dolphins.  The  echoes 
were  time -stretched  into  the  human  auditory  range.  Martin  and  Au  (1982, 
1986)  used  a  similar  technique  but  performed  human  listening  experiments 
in  a  sound  booth.  Their  specific  objective  was  to  determine  what  acoustic 
cues  were  available  for  discrimination. 

Targets  were  examined  acoustically  using  a  monostatic  sonar  system 
(see  Au  and  Synder,  1980)  that  projected  a  broadband  simulated  dolphin 
signal.  The  echoes  were  digitized  at  a  sample  rate  of  1  MHz  and  stored  on 
magnetic  tape.  The  digitized  echoes  were  played  back  to  hvunans  at  a 
sample  rate  of  20  Khz,  which  effectively  stretched  the  echoes  by  a  factor 
of  50  and  lowered  the  frequency  by  a  factor  of  50.  For  each  trial,  one  of 
ten  echoes  was  presented  to  the  subjects  at  a  rate  of  4  echoes  per  second. 
The  specific  echo  in  a  set  of  ten  echoes  per  target  was  randomly  selected 
for  each  trial. 


A.  Discrimination  of  cylindrical  targets 

Human  subjects  could  easily  discriminate  standard  aliuninum  cylinders 
from  probe  targets  used  in  the  general  discrimination  and  material  compo¬ 
sition  experiment  of  Hammer  and  Au  (1980).  For  the  general  discrimination 
task,  the  two  standard  aluminum  cylinders  had  outer  diameters  of  3.81  cm 
and  7.62  cm  with  wall  thicknesses  of  0.64  and  0.95  cm,  respectively.  The 
probe  targets  are  described  in  Table  1.  The  discrimination  task  was  found 
to  be  trivial  even  though  some  of  the  probes  were  aluminvun  cylinders , 
solid  and  hollow. 

In  the  material  composition  discrimination  task,  subjects  had  to  dis¬ 
criminate  two  aluminum  standards  from  targets  made  of  steel,  bronze  and 
glass.  The  targets  had  one  of  two  wall  thicknesses,  0.32  and  0.40  cm  and 
outer  diameters,  3.81  and  7.62  cm,  respectively.  The  dolphin  could  dis¬ 
criminate  the  aluminum  standards  from  the  bronze  and  steel  probes  but 
classified  the  glass  probes  with  the  aluminum  standards.  The  average 

Table  1.  Probe  targets  used  in  the  general  discrimination  experiment. 


Reference 

IPl 

IP2 

IP3 

IP4 

IP5 

IP6 

IP7 

IPS 

Composition 

Al 

CPN 

Al 

CPN 

Al 

CRK 

Al 

PVC 

Wall  (cm) 

0.48 

solid 

solid 

solid 

0.64 

solid 

solid 

0.79 

O.D.  (cm) 

6.35 

6.35 

3.81 

4.06 

11.43 

11.43 

7.62 

7.62 

performance  of  three  human  subjects  in  the  aluminum  versus  steel  and 
altiminuffl  versus  bronze  was  98%  and  95%  correct,  respectively.  Subjects 
first  determined  whether  an  echo  originated  from  a  large  or  small  cylinder 
based  on  duration  and  TSP  cues.  Echoes  from  the  large  cylinders  had  lower 
TSP  and  longer  durations.  Subjects  reported  that  the  small  aluminum  had 
lower  TSP  than  the  small  bronze  cylinder  and  the  large  aluminum  had  a  TSP 
cue  whereas  the  large  bronze  did  not.  Examples  of  the  target  echoes  are 
shown  in  Fig.  6  for  the  small  cylinders.  One  can  see  that  the  aluminum 
should  have  a  higher  TSP  than  the  small  bronze  target.  The  aluminum  versus 
steel  discrimination  was  based  on  hearing  clearly  perceptible  TSP  with  the 
aluminum  targets  and  less  perceptible  TSP  with  the  steel  targets. 


Schusterman,  et  al.  (1980)  trained  the  dolphin  used  by  Hammer  and  Au 
(1980)  to  discriminate  between  the  small  aluminum  and  glass  targets  using 
a  two -alternative  forced-choice  paradigm.  They  were  not  able  to  train 
the  animal  to  discriminate  between  the  large  aluminum  and  glass  cylinders. 
In  the  human  listerning  experiment,  four  subjects  discriminated  between 
the  aluminum  and  glass  targets  with  performance  accuracy  varying  from  74% 
to  94%  correct.  The  main  discrimination  cue  was  difference  in  echo  dura¬ 
tions  between  the  aluminum  and  glass  echoes.  Typical  examples  of  the 
echoes  from  the  targets  are  shown  in  Fig.  7.  The  glass  echoes  damped  out 
approximately  14  and  5  ms  before  the  aluminum  echoes  for  the  small  and 
large  targets,  respectively.  The  duration  difference  may  not  have  been 
perceptible  to  the  dolphin,  but  could  be  perceived  by  humans  as  a  result 
of  the  time  expansion  of  50.  The  ambient  noise  was  not  a  problem  for  the 
dolphin  since  it  typically  operated  with  E/Nq  in  excess  of  40  to  50  dB. 


The  duration  cues  for  the  large  aluminum  versus  glass  discrimination 
were  examined  further  by  truncating  the  echoes  between  groups  of  high¬ 
lights,  indicated  by  the  tick  marks  in  Fig.  7.  The  truncation  caused  the 
signals  to  be  of  equal  duration,  eliminating  any  duration  cues.  The 
performance  of  two  subjects  as  a  function  of  the  signal  length  is  shown  in 
Fig.  8.  Discrimination  accuracy  decreased  as  the  signals  became  shorter. 
The  final  truncation  eliminated  all  but  the  first  two  echo  components,  yet 
the  subjects  were  able  to  discriminate  the  signals  above  70%  correct.  The 

ECHO  FREOUENCV  SPECTRUM  ENVELOPE  OF  MATCHED  FILTER  RESPONSE 


Fig.  6.  Backscatter  measurements  of  the  3.81-cm  cylinders.  Highlight 
arrival  times  are  indicated  on  the  matched  filter  results. 
(From  Hammer  and  Au,  1980). 
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Fig.  7.  Echoes  from  7.62-cm  aluminum  (solid)  and  glass  (dashed)  cylin¬ 
ders  (From  Martin  and  Au,  1986). 

time  between  the  first  and  second  echo  components  was  virtually  the  same 
for  both  targets.  Thus,  the  discrimination  was  based  on  cues  other  than 
TSP  differences.  The  subjects  indicated  that  the  glass  target  had  a 
slightly  higher  "click  pitch"  than  the  aluminum.  This  cue  was  difficult  to 
extract  and  was  not  always  reliable. 

B.  Discrimination  of  Cylindrical  Target  in  Noise 

Martin  and  Au  (1986)  next  performed  material  composition  discrimina¬ 
tion  in  noise.  The  results  are  shown  in  Fig.  9  for  the  aluminum  versus 
glass  discrimination.  For  the  3.81-cm  cylinders,  E/Nq  had  to  be  at  least 
10  dB  greater  than  the  detection  threshold  in  noise  before  a  subject  could 
discriminate  above  75%  correct.  For  the  7.62-cm  cylinders,  E/Nq  had  to  be 
at  least  30  dB  greater  than  the  detection  threshold.  Subject  NN  gained  a 
sudden  insight  into  discriminating  the  7.62-cm  cylinders  at  the  25  and  30 
dB  signal- to-noise  ratio.  In  general,  learning  the  discrimination  was 
insightful,  as  subjects  began  to  utilize  different  cues. 

C.  Sphere  -  Cylinder  Discrimination 

Human  discrimination  between  spheres  and  cylinders  was  measured  using 
foam,  solid  aluminum  and  water-filled  steel  targets.  Au  et  al.  (1980) 
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Human  discrimination  of  7.62-cm  aluminvim  and  glass  cylinders 
versus  signal  duration  (From  Martin  and  Au,  1982). 


Fig.  9.  Aluminum  versus  glass  discrimination  results  as  a  function  of 
the  signal  energy- to-noise  ratio  (From  Martin  and  Au,  1986). 


demons traced  that  an  echolocatlng  dolphin  could  discriminate  between  foam 
spheres  and  cylinders.  They  speculated  Chat  echoes  returning  via  a  sur¬ 
face  reflected  path  were  larger  in  amplitude  for  spheres  than  for  cylin¬ 
ders.  However,  when  the  surface  reflected  echo  component  was  blocked  with 
an  absorbent  "horsehair”  mat,  Che  dolphin  still  performed  the  discrimina¬ 
tion.  The  same  foam  targets  were  also  used  with  humans.  Discrimination 
results  pooled  across  subjects  are  given  in  Table  2.  The  average  probabi¬ 
lity  of  correct  discrimination  varied  between  84%  and  96%.  The  windowed 
results  indicated  that  surface  echo  components  did  provide  cues  but  both 
the  dolphin  and  humans  could  perform  the  discrimination  Cask  without  these 
cues . 

Subjects  reported  that  two  cues  used  for  discrimination  were  a  higher 
pitch  associated  with  cylinder  echoes  and  the  presence  of  low-frequency 
reverberation  in  the  sphere  echoes.  The  target  strength  of  a  cylinder 
increases  with  frequency  and  is  constant  with  frequency  for  a  sphere 
(Urick,  1983).  An  example  of  echoes  from  a  foam  sphere  and  cylinder  is 
shown  in  Fig.  10.  The  sphere  echo  has  a  low  amplitude  portion  following 
the  main  reflection;  the  cylinder  echo  does  not.  The  spectrum  for  the 
cylinder  has  more  energy  than  the  sphere's  at  higher  frequencies. 

Perfoirmance  of  two  subjects  in  the  solid  aluminum  sphere -cylinder  and 
the  water-filled  steel  sphere-cylinder  was  between  94%  and  100%  corrset. 
Subjects  reported  that  the  metal  target  echoes  did  not  sound  like  foam 
echoes.  However,  the  same  discrimination  cues  were  used:  a  higher  click 
pitch  for  cylinders,  and  more  reverberation  for  spheres. 

D.  Target  Detection  in  Reverberation 

The  clutter  screen  experiment  of  Au  and  Turl  (1983)  was  examined  with 
the  targets  in  the  plane  of  the  clutter  screen.  The  human  and  dolphin 
performance  as  a  function  of  the  peak-to-peak  echo- to -reverberation  ratio 
are  shown  in  Fig.  11.  The  cue  used  by  the  humans  was  the  presence  of  a 
"click"  sound  from  the  targets.  The  echoes  from  the  clutter  screen 
sounded  diffused,  whereas  the  echoes  from  the  aluminum  cylinders  sounded 
like  compact  clicks  with  a  definite  TSP.  Subjects  learned  to  integrate 
only  over  the  duration  of  the  target  echo. 


Table  2.  Foam  sphere  versus  cylinder  dolphin  discrimination  results. 

The  windowed  results  refer  to  echoes  which  had  the  air-water 
surface  reflected  components  truncated. 
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Fig.  10.  Echoes  from  foiim  sphere  (solid)  and  cylinder  (dotted).  (From 
Martin  and  Au,  1986) . 

E.  Aspect  Independent  Discrimination  of  Cylinders 

The  final  experiment  involved  a  material  composition  discrimination  of 
cylinders  at  various  aspects.  The  procedure  involved  training  the  dolphin 
to  discriminate  between  two  targets  at  the  baseline  aspects  of  0,  45,  and 
90°,  where  0°  was  the  broadside  aspect.  After  the  dolphin  could  perform 
the  discrimination  at  90%  correct  for  the  baseline  aspects,  probe  trials 
at  different  aspects  was  used.  In  the  first  discrimination  task,  aluminum 
versus  steel,  the  dolphin  performance  did  not  reach  the  90%  criterion  for 
45  and  90°  aspect.  An  easier  discrimination  of  aluminum  versus  coral  rock 
was  then  chosen.  The  dolphin's  results  are  shown  in  Table  3,  which  in¬ 
cludes  both  the  baseline  (shaded)  and  probe  results. 

A  similar  procedure  was  used  with  the  humans  but  with  a  wider  variety 
of  targets.  Subjects  were  trained  to  discriminate  between  pairs  of  tar¬ 
gets  for  the  baseline  angles.  After  achieving  near  100%  correct  perform¬ 
ance  on  the  baseline  angles,  sessions  were  made  more  complex  by  presenting 
echoes  from  seven  aspects,  0,  15,  30,  45,  60,  75  and  90°.  The  human 
results  pooled  over  the  four  subjects  are  shown  in  Table  4.  As  with  the 
dolphin,  the  humans  could  perform  the  aluminum  versus  rock  discrimination 
readily  but  had  problems  with  the  aluminum  versus  steel  discrimination 
(4th  column  of  Table  4).  When  the  subjects  were  given  the  possibility  of 
having  any  one  of  seven  aspects  presented  for  a  given  trial,  performance 
at  the  baseline  angles  deteriorated.  No  specific  cues  could  be  focused  on 
in  this  experiment  since  the  echoes  were  so  complex.  However,  the  human 


Table  3.  Dolphin  discriainatlon  results  as  a  function  of  aspect  angle 
of  the  targets  (From  Martin  and  Au,  1986). 
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subjects  Indicated  that  echoes  at  the  45^  aspect  angle  contained  the  most 
information  pertinent  to  the  other  angles. 


IV.  DISCUSSION  AND  CONCLUSIONS 

An  echolocating  dolphin's  ability  to  detect  target  echoes  in  noise  and 
to  make  fine  discrimination  of  target  features  seems  similar  to  human 
auditory  detection  and  pattern  recognition  capabilities.  Although  human 


Fig.  11.  Target  detection  in  clutter  performance  results  for  humans  and 
dolphin  (From  Martin  and  Au,  1986), 


Table  4.  Human  dlscriminacion  results  as  a  function  of  target  aspect 
angle  for  different  target  pairs.  The  values  In  each  column 
are  the  percent  correct  results  In  Identifying  a  particular 
target  In  a  target  pair  (From  Martin  and  Au,  1986). 
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hearing  experiments  were  performed  differently  and  with  different  stimuli, 
detection  thresholds  of  dolphin  and  man  are  comparable.  The  question  that 
comes  immediately  to  mind  is  "why  does  the  dolphin  f)nar  perform  better 
than  man*made  sonars  in  shallow  water  environments?"  There  are  several 
factors  inherent  in  dolphins  that  may  provide  advantages  over  man-made 
sonars.  A  dolphin  is  a  highly  mobile  aquatic  mammal  that  is  capable  of 
using  its  sonar  while  in  motion,  going  to  different  depths  and  locations, 
and  looking  at  objects  from  various  aspects.  Reverberation  and  noise 
observed  from  one  aspect  may  be  different  when  observed  at  other  aspects, 
and  locations.  The  dolphin  may  also  have  good  long  teirm  auditory  and 
spatial  memory  which  would  be  effective  in  recognizing  desired  targets  in 
specific  positions  and  in  spatial  pattern  recognition  of  echoes  from 
various  locations. 

Most  man-made  active  sonars  have  been  designed  to  eliminate  the  human 
auditory  capabilities  from  the  system.  Therefore,  the  excellent  analysis 
and  pattern  recognition  capabilities  of  the  human  auditory  system  are 
ignored.  When  a  listening  capability  is  included  in  an  active  sonar,  less 
than  ideal  kinds  of  signals  are  often  used:  narrowband  pulse  tones  or 
continous  transmission  frequency  modulated  signals  (CTFM) .  Neither  of 
these  signals  possess  the  time  resolution  capabilities  of  broadband  tran¬ 
sient-like  pulses  that  dolphins  use. 

Human  listening  experiments  using  dolphin-like  sonar  signals  can  be 
useful  in  understanding  target  cues  and  processing  methods  needed  to 
extract  them.  The  human  listening  experiments  of  Martin  and  Au  (1982, 
1986)  indicated  that  differences  in  time-separation  pitch  associated  with 
correlated  echo  highlights  and  differences  in  echo  duration  were  the 
predominant  discrimination  cues  in  almost  all  of  the  tasks.  Duration  cues 
as  much  as  30  dB  below  the  peak  level  of  the  primary  echo  component  were 
found  to  be  important  and  useful.  Only  in  the  sphere -cylinder  and  trun¬ 
cated  aluminum- glass  discriminations  was  spectral  information  in  the  form 
of  click  pitch  an  important  cue.  These  cues  used  by  the  human  listeners 
may  be  the  same  cues  used  by  echolocating  dolphins.  It  seems  that  time 
domain  processing  of  highlight  separation  and  highlight  amplitudes  within 
echoes  may  provide  most  of  the  target  information  to  the  dolphin.  There¬ 
fore,  broadband  transient- like  echolocation  signals  with  good  time  resolu¬ 
tion  would  be  most  useful  to  dolphins  and  humans. 
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